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Abstract 

Optimization problems over compact Lie groups have been extensively studied due to their 
broad applications in linear programming and optimal control. This paper analyzes least square 
problems over a noncompact Lie group, the symplectic group Sp(2A^, R), which can be used 
to assess the optimality of control over dynamical transformations in classical mechanics and 
quantum optics. The critical topology for minimizing the Frobenius distance from a target 
symplectic transformation is solved. It is shown that the critical points include a unique local 
minimum and a number of saddle points. The topology is more complicated than those of 
previously studied problems on compact Lie groups such as the orthogonal and unitary groups 
because the incompatibility of the Frobenius norm with the pseudo-Riemannian structure on 
the symplectic group brings significant nonlinearity to the problem. Nonetheless, the lack of 
traps guarantees the global convergence of local optimization algorithms. 
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I. INTRODUCTION 



The topology of solution sets to problems in the calculus of variations is the subject 
of considerable interest in mathematical physics and optimization theory It is of 
particula. i,„pona.ce i„ theory of op.„„a. con..,, whe. this topology can affect the 
efficiency of the search for effective control Hamiltonians [2y. Whereas in general it is 
very difficult to characterize these features for arbitrary functionals, when the objective 
or Lagrangian functional is defined on a Lie group, it is often possible to apply techniques 
from the theory of Lie groups and differential geometry to simplify the extraction of 
critical topology. 

The topology of the critical submanifolds of classical Lie groups was originally studied 
by Frankel 3(], who characterized the number of critical points and associated Morse 
indices of the trace function on compact classical Lie groups U{N), 0{N), and Sp(A^). 
Dynnikov and Vesselov |J] subsequently identified these functions as perfect Morse-Bott 
functions and showed that they afford a cell decomposition of the associated groups. 
Recently, the equivalence of the trace function to that of a least-square matrix function 
for the distance between a real and target transformation led to the application of these 
results to optimization and control theory. Brockett [S] showed that a wide range of 
combinatorial optimization problems arising in linear programming can be framed as 
matrix least squares optimizations on compact Lie groups. In [g] , the critical topology of 
the trace function on U{N) was analyzed in light of its connection to the optimal control 
problem of implementing a quantum logic gate over discrete variables with maximal 
fidelity. 

A unifying feature of these problems is the fact that the domain of the objective 
functional, being a compact Lie group, can always be endowed with the structure of 
a differential manifold with a bi-invariant Riemannian metric. In this paper, we at- 
tempt to extend such studies to the investigation of critical submanifolds of least squares 
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objective functions on noncompact Lie groups 3], in particular the symplectic group 
Sp(2A^, M). Although the geometry and topology of symplectic manifolds, and functions 
defined on those manifolds, have been the subject of extensive investigations in math- 
ematical physics, functions defined on the symplectic group itself have received far less 
attention. 

Specifically, we are concerned with the least-square distance function on the space of 
symplectic matrices, 

J{S) = \\S -W\\^ = Tt{S -Wf{S -W), SeSp{2N,R). (1) 

This cost function has recently been shown to have fundamental applications in the 
assessment of the fidelity of dynamical gates in quantum analog computation when im- 
plemented through optimal control theory [8!], where W represents the target quantum 
gate to be realized. Another potential important motivation for studying this problem 
comes from the control of beam systems in particle accelerators [9]. As shown before, 
the cost function ([1]) on compact Lie groups (e.g., U{N), 0{N)) is equivalent to a linear 
trace function. However, this no longer holds on the symplectic group, because the corre- 
sponding Riemannian metric is not bi-invariant under symplectic transformations. This 
feature is caused by incompatibility of the Frobenius norm defined in ([1]) with the geomet- 
ric structure of the symplectic group, and greatly complexifies the critical topology, as we 
will show below. On the other hand, this group can be treated as a pseudo-Riemannian 
manifold with a bi-invariant pseudo-Riemannian metric. Although it is possible to intro- 
duce an objective function that is compatible with this pseudo-Riemannian metric |lol |. 
such function is not positive definite and cannot be interpreted as a distance function. 
However, the corresponding critical topology is equivalent to that of a linear trace func- 
tion on Sp(2A^, M), as well as those on U{N) and 0{N). In contrast to the objective 
functionals ([1]), such compatibility leads to a simple critical topology as the effects of the 
pseudo-Riemannian geometry of noncompact Lie groups. 

Existing works on control of classical mechanical systems generally do not require 



direct control of the system propagators 



except some special cases (e.g., robotic motion 



planning on the Euclidean group SE{3) It is usually sufficient to attain the control 



over state vector with fewer degrees of freedom. However, since any given state vector in 
phase space is associated with an infinite number of symplectic matrices that propagate 
the initial state of the system to the desired final state, it is generally impossible to predict 
which of these symplectic matrices will be reached by the time-dependent control obtained 
through the optimization procedureAs such, the efficiency of control optimization will be 
highly system-dependent, with optimization algorithms traversing longer trajectories in 
the symplectic group for certain classes of Hamiltonians. In contrast, if the control 
problem is cast in terms of symplectic propagator optimization, it is possible to choose 
the shortest path in the symplectic group from the initial condition to the target [21]. As 
such, gradient control algorithms based on propagator optimization may outperform those 
based on state vector optimization. Optimization algorithms of this type are currently 
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the subject of intense study in the context of quantum control 

In a study of optimization algorithms on noncompact Lie groups, Mahony indicated 
that (local) quadratic convergence can still be achieved by using the Newton method 
adapted for the curved manifold under local coordinates of the ffist kind j?], which results 
in no essential differences compared to algorithms on compact Lie groups. However, the 
global topology of the optima and suboptima may play a fundamental role in the overall 
efficiency of the optimizations, and will be the major concern of our studies here. This 
paper is organized as follows. Section II summarizes the definition and properties of 
symplectic groups. Section HI derives the canonical form of landscape critical points. 
Section IV analyzes the Hessian quadratic form for each of these critical points. Section 
V studies the constrained landscape over the compact subgroup. Section VI provides an 
illustrative example. Finally, Section VII draws the conclusion. 
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II. PRELIMINARIES ON THE SYMPLECTIC GROUP 



In classical mechanics, a transformation for a system described by pairs of coordinate 
and momentum variables = (a;^^^, ■ ■ ■ ,x^^/3',Pa^i3, ■ ■ ■ ,Pa,i3) called symplectic if it 
preserves the (skew-symmetric) symplectic form 



N 

(JJ{Z. 



a J 



J 



With this coordinate system, a symplectic transformation can be represented by a 2N x 
2N dimensional real matrix that satisfies S^JS = J, where S'^ is the transpose of S and 

In 

-In 

The set of symplectic matrices forms a noncompact Lie group Sp(2A^, M), and its Lie 
algebra sp(2A^,R) = {JB\B'^ = B, Be R^^''^'^}, from which it is easy to see that 
the dimension of Sp(2A^, M) is N{2N + 1). As a linear vector space, sp(2A^, M) can be 
decomposed into two mutually orthogonal subspaces sp(2A^, M) = £1 © C2, where 

£1 = {JB esp{2N,R)\JB = -BJ}, 
£2 = {JB e sp{2N,R)\JB = BJ}. 

The subspace C2 is a Lie subalgebra of sp(2A^, M). It generates the orthogonal symplec- 
tic group 0Sp(2A, M) as the maximal compact Lie subgroup of Sp(2A^, M), as it is the 
intersection of the symplectic group Sp(2iV, R) with the orthogonal group 0{2N). There 
is an interesting isomorphism between 0Sp(2iV, R) and the unitary group U{N) via the 
following mapping: 

( X y\ 

X-iY^\ , A'-!y'eM(JV). (2) 

\-Y X) 

Here we briefly summarize some properties of symplectic matrices and symplectic 
groups that will be used in the following analysis. Readers of interests are referred to 9| 
for more details. 



Propoerty 1 As the analog of the property O ^ = for any orthogonal matrix O, 
S^^ = J^S^J for any symplectic matrix S. 



Propoerty 2 The eigenvalues of a symplectic matrix always appear in reciprocal pairs, 
i.e., if uo is an eigenvalue of S, then so is uj~^ , and they have identical degeneracy degrees. 

Propoerty 3 There always exists a symplectic singular value decomposition (SVD) S = 
UDV , where U and V are orthogonal symplectic matrices. D is a diagonal symplectic 
matrix whose diagonal elements are the singular values of S. 

Propoerty 4 Denote by Stab(D) = {R e 0Sp(2iV,R) : R^DR = D} the stabilizer of 
the diagonal symplectic matrix D in the group 0Sp(2A^, R). The stabilizer of a diagonal 
symplectic matrix 

where 1 = oq < ai < 02 ■ ■ ■ < cir, is the direct product of subgroups Stab(D) = 
OSp(2no, M) X 0{ni) x ■ ■ ■ x O(n^). 

The above features can be demonstrated by the example of the 2-qunit SUM eate 

n n 

in continuous quantum computation [8|, [16|; 
{11,(12; Pi, P2) as follows 



17l |. which acts on the quadratic vector 



SUM : Qu qi + q2, Pi^pi- P2, P2 P2- 



The matrix form of the SUM gate is 



SUM 



/ 1 \ 
110 
1-1 
\0 1 J 



(3) 
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whose singular value decomposition SUM = UEV can be found to be 



U 



-r] 

-7] ^ 

-V 

V -V ^ J 



where 
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Since there is a two- fold degeneracy of the singular value a;, the stabilizer of E is isomor- 
phic to the 0(2) group. 



III. CANONICAL FORM OF THE CRITICAL SUBMANIFOLDS 

Any candidate solution to the optimization problem ([T])must be one of its critical 
points, defined as a 5* G Sp(2A^, M) such that the gradient VJ(S'*) of the cost function 
vanishes. Since there generally exist multiple-solutions for the critical points, a complete 
understanding of the critical topology is of essential importance to assess the complexity 
of searching for the global optimal solution. 

The basic idea to determine the set of critical solutions is, at an arbitrary fixed point 
S G Sp(2A^, M), to perturb the cost function along an arbitrary direction in the tangent 
space (isomorphic to sp(2A^, M)) and find those points where the directional derivation 
vanishes along all directions. For example, taking the parametrization Se^'^^ with = Y 
(here JY represents the local Cartesian coordinates in the tangent space at S), the critical 
condition can be obtained by forcing the derivative along JY to be zero at t = for 
arbitrary Y, i.e., 

d 



J{Se"^) = Tt[JY{S' S - W' S)] = 0, V Y' = Y, 



To TJ/T CM n W 



t=0 
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which imphes that the matrix [S'^S — W'^S)J has to be skew-symmetric, i.e., 

{S^S- W'^S) J=J{S^S- S^W) . (4) 

Left multiplying a constant matrix and applying the property J^S^J = S^^, we get 
a simpler form: 

S^S - {S^S)-^ = S^W - {S^W)-\ (5) 

Although ([5]) is a nonlinear (fourth-order in S) equation, its highly symmetric form 
makes it still solvable. Let S = UiDVi be a symplectic SVD of 5*, and W = UoEdVo be 
that of W. Substituting these SVDs into the equation (Q, we can simplify the condition 

as 

- D-^ = DE - {DEy^, 

where E = UE^V with U = UiUq and V = VqV^. Followed by a commutation with 
DE on both sides, this equation is further transformed as [DE, — D""^] = 0, or 
equivalently, [E, — D^^] = 0. We will show that this relation implies [E, D] = 0. 
Let dr > ■ ■ ■ > di > 1 > d^^ > ■ ■ ■ < d~^ be the distinct eigenvalues of D, where the 
degeneracy of di (or d^^) is Mi, i = 1, ■ ■ ■ ,r; then D can be decomposed into diagonal 
blocks Di = dilMii i = - ■ ■ and their inverses. The commutativity of E and D^ — D^^ 
implies that E is simultaneously block- diagonal with — D~^, corresponding to sub- 
blocks (d^ — dj'^)lMi- Since D is positive definite, two distinct eigenvalues of D must 
correspond to two distinct eigenvalues of — D^^, and vice versa. So D shares the same 
eigenspace decomposition with — D~^, as well as that of E. Hence E commutes with 
D. 

Therefore, for each Di in D, there corresponds a diagonal block Ei of E . Let Ei = 
UiEoiVi be a SVD of Ei, then the following block-diagonal symplectic orthogonal matrices 

Ud = diag{Ui, ■■■ ,Ur] , ■■■ , Uj}, 
Vd = dtag{V,,--- ,Vr;V,^,--- ,V,^}, 
Ed = diag{Eoi, ■ ■ ■ , Eor; Eq^, ■ ■ ■ , -Eq"/} 
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define a SVD E = UdEdVd of E. Hence, for any general SVD E = UEaV, the following 
relationship 

iUjU)E,iVV^) = E,, 

implies the existence of a symplectic orthogonal matrix R G Stab(£'rf) such that U = UdR 
and V = R^Vd. 

Going back to the original symplectic matrix S*, we then have a uniform expression 
for the critical solutions: 

S* = UiDVi = UoU^DV^Vo = UoR^ {Uj DVj)RVo. 

Notice that f/J commutes with by which we can denote L = UjVj , and then simplify 
the representation of the critical points as folows: 

S* = UoR^PRVo, P = DL, Re Stab(Ed). (6) 

This canonical form shows that the critical manifold consists of orbits of admissible 
matrices P under the action of Stab{Eii), which can be represented by the quotient set 
A4 = Stab (E'rf) /Stab (P). It is then sufficient to characterize the set of critical points by 
specify all possible values of the characteristic matrix P that involves the singular values 
di and their corresponding orthogonal matrix blocks Lj = U^V^^. 

For simplicity, owing to the reciprocal properties of singular values of symplectic ma- 
trices, we only need to analyze the singular values that are no less than 1. Restricting 
the matrix equation on the eigenspace of each di> 1, and substituting the canonical 
form into we have 

(d^ - df)h.u = d,E,,U - d;'LjE-^. (7) 
Let ei, ■ ■ ■ , be the singular values of EQi. This equation can be decomposed as 

diCaU^ap - d^^e^^Li^pa = {dj - d^'^)6ai3, a,P = 1, - ■ ■ ,Mi, (8) 
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where Li^a/s is the a(3-th matrix element of Lj. Using equation ([HD, we classify the sub- 
blocks in P into the following different types. 

Firstly, suppose that the orthogonal matrix Lj is diagonal, then each element Li^^a 
has to be unimodular. When Li^aa = 1, we get 

(rf, -e^)(4 + e-^) = 0, 

whose only admissible positive root is di = e^, where > 1. The case Li^aa = — 1 
corresponds to 

{di + e^){4 - e-^) = 0, 

— 1/3 

of which the only admissible positive root is di = e-y where e-y < 1. 

In such cases, each block Ei allows for only one singular value so that all eigenvalues 
of Di are identical. Corresponding the case Li = Im^, the block Di is called type I; for 
the case Li = —Ihii, the block Di is called type II. 

For the more general case that Li is not diagonal, any pair of nonzero off-diagonal 
matrix elements must satisfy 





(9) 



-d~^e-^ diCfs 

in which the determinant of the coefficient matrix has to vanish, and this solves the 
eigenvalue di, i.e., 

d^iCaCfs - {dle^epY^ = di = [e^eaY^I'^ . (10) 

Consequently, substituting (fTOj) back into the equation ([9]), we find that the resulting 
nonzero off-diagonal matrix element Lj^^/? = -Z^i,/3o, i-e., the matrix Li must be symmetric. 
Obviously, in such cases each minimal block Di allows for exactly two distinct singular 
values of Ei (otherwise Di will have non- unique singular values), and their repeating 
number are both ki = Mi/2. 
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Without loss of generality, we assume that Cq, > ep and di = {eaCp) > 1. Then the 
use of ( JTOjl solves the corresponding diagonal elements from ([8]) as follows 

As Li is orthogonal, each of its matrix elements must satisfy < Li^ap < 1, which set 
additional constraints on the admissible pairs and that generate di. 

' < e„ < ep\ (11) 

Under such conditions, the corresponding block Di is called a type III block. Suppose 
that Ei = diag{ealki, ^ishi}, the corresponding matrix Li must be in the following form: 

cosxj sinxj Of. 
sin Xi Ok, - cos Xi h. 



(e^/e^)2-(e^/e„) ^ ^ ' 2 



where O is some orthogonal matrix and the angle Xi = arccos — (£^£^} e [0 

Let T = diag{0, O}, then P^LiP is in the following canonical form 

cos Xilk^ sin Xj Jfc. 
sin Xj/fc. — cos Xj/fc. 

Since the transformation matrix T is in the stabilizer Stab(£'rf), so Li can be always 
represented by the above standard form. 

In conclusion, suppose that W has s greater-than-1 singular values 1 < Ui < 002 < 
■ ■ ■ < uJs with degeneracy degrees rii, ■ ■ ■ ,ns. From the above analyses, each given singular 
value uja > 1 can be used to produce a singular value di in the canonical form P through 
the following three ways: 

I. di = oja with multiplicity and a corresponding matrix block P'^ = Ualm'^ in P; 
_i 

II. di = Ua^ < ^ with multiplicity and a corresponding matrix block P^ = 



-Ua ^ Im'l in P', 
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III. di = [ijja/ujfj) ^^"^ with multiplicity 2m'^p, where Ua ^ ujf3 (defined as that 

1 /3 

^13 < < ^13), which are from m'^'^ eigenvalues a;„ and m'^p eigenvalues LOp. 
The corresponding block is 



p/ir 

^af3 



CJ^ \ 2 / COS Xafulm'^'^ Sm Xafslm'^'^ 



\ sinXapIni"' -COSXapIni"' 



where Xap = arccos . 

The singular value cjq = 1 can be treated separately. Since the condition (fTT!) can never 
be satisfied with uq = 1 and any other singular values of E, uo = 1 is always incapable of 
generating a type III singular value di ^ 1 via (ITOl) . Thus, they only contributes to type 
I or II singular values = 1 of Z^. Let the degeneracy number of ujq = 1, which must be 
even, be 2no, and the number of type I singular values is mo, the possible characteristic 
matrix block Pq is 

Po = j I , mo = 0, ■ ■ ■ ,no. 

y ~Infi-mo j 

Any group of admissible indices {mo; m^, m^', m^'^; a, /5 = 1, ■ ■ ■ , s} labels an orbit of 
Stab(-E(i), which equivalently labels a unique critical submanifold of the set of critical 
points. The value of the cost function at these critical submanifolds are: 

IV. TOPOLOGY ANALYSIS OF CRITICAL SUBMANIFOLDS 

This section will delve into more intrinsic topological details of the critical manifolds, 
including (1) their connectedness, determined by counting the number of separate sub- 
manifolds, (2) their dimensions and local optimality status (i.e, local maximum, minimum 
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or saddle point), determined via Hessian analysis. Such information provides a global 
picture of the distribution of possible solutions and their influences on the actual search 
for optimal solutions to the optimization problem. 

The number of critical submanifolds can be enumerated by counting all admissible 
combinations of indices {mo; m^, m^, m^'^; a, /5 = 1, ■ ■ ■ ,s}, each of which corresponds 
to a unique characteristic matrix P, and hence labels a critical submanifold as the orbit 
of P under the action of Stab(i?rf) [22]. This number is dependent with the degenerate 
structure of the singular values of the target transformation W. For example, the simplest 
case is that W is an orthogonal symplectic matrix, where Ed = I2N and P has only ±1 
singular values. The total number of critical submanifolds is + 1 corresponding to 
m = 0, 1, ■ ■ ■ , A^, the repeating number of the —1 eigenvalues in Pq. When W has a fully 
degenerate singular value > 1, the admissible characteristic matrices P may contain 
either of the I-III types of sub-blocks, and hence there are more critical submanifolds. 
Let m'" be the number of pairs that generate 2m'" type III singular values = 1 in Z), 
m' for d = to and m" for d = to^^'^. Since m' + m" + 2m'" = A^, counting such admissible 
combinations gives the number of critical submanifolds as a quadratic function of A^: 



The number of critical submanifolds shoots up when the degeneracy in W is broken 
up. The extremal case is that Ed is fully non-degenerate and the singular values of E^ 
are not far apart from each other such that uJa ^ ujp for any a < (3, i.e., any two distinct 
singular values of W are allowed to produce a pair of type III singular values of S* . Let 
m be the number of pairs of singular values of W that generate type III singular values 
of S*\ then there are A^!/[(A^ — 2m)!m!2™] different choices. Moreover, for each fixed m, 
the possibilities of using the remaining singular values to generate I or II type singular 




A^ even; 



A^ odd; 



(12) 
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values of S* is 2^ These set an upper bound 

2N—3m 



m\{N -2m)V 

which is super-exponential in A'^, on the maximal number of critical submanifolds in all 
cases. 

The dimensions of the critical submanifolds are generally difficult to calculate. How- 
ever, are simple for those that contains only type I and II blocks (i.e., m'^i^ — 0, for all 
a, /5) via their geometrical expression A4 — Stab(£'d)/Stab(P), i.e., 

dimAl = dimStab(£;d) - dimStab(P). 

As stated in Section II, Stab{Ed) is the product of orthogonal subgroups 0{nfj) and a 
symplectic orthogonal group OSp(2no,lR) (for ujq — 1). The stabihzer of P is a Lie 
subgroup of Stab(£^rf), which is the product of 0{m'^ and 0{rn"^ for type I and II 
singular values. Therefore, such critical submanifolds can be represented as 

QSp(2no,R) A OK) 

OSp(2mo, K) X OSp(2no - 2mo, M) 0{m'^) x 0{ml) ' 

and their dimensions can be easily evaluated as 

s 

V = 2mo(no - mo) + ^ m'^rn'l^- (13) 

The optimality status of these critical submanifolds can be acquired from analysis 
of the local geometric structure for each of the critical submanifolds via their Hessian 
quadratic form (HQF). The numbers of positive, negative and zero Hessian eigenvalues 
determine the optimality status, i.e., a critical point is a local minimum (maximum) if 
all the eigenvalues are positive (negative), otherwise it is a saddle point. The HQF is 
defined as the second-order term of Y in the Taylor expansion of the parametrization 
g^tJY ^ which is dominant in the neighborhood of -S"* while the first-order term vanishes 
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at S. It is not difficult to obtain that 

H{Y) = Tr [JY{S^^^S - W^S)JY + JYS^S{JYf] 

= Tr [JYVj'R^iP^ - EdP)RVoJY + JYVj' RVo(JYf] 

Notice that (1) P^ = because L commutes with D and L is symmetric orthogonal; 
(2) JRVo = RVoJ, we may transform Y into X = {RVQ)Y{RVQf and rewrite the HQF 
as 

H{X) = Tr [JX{D^ - DEdLd)JX + JXD^{JXf]. (14) 

Let X be the vector of independent variables in X, then Ti-^X) can be written as a 
quadratic form x-^Qx, where Q is a symmetric N{2N + 1) x N{2N + 1) matrix. The 
Hessian eigenvalues are defined as the eigenvalues of the matrix Q. 

Let D = diag{Q,Q~^}, DEa = diag{il,il~^} and L = diag{^,^}. Dividing the 
symmetric matrix X as 

f A 
\C B 

where A and B are symmetric, we may rewrite the HQF as the function of A, B and C, 
i.e., 

n{A, B, C) = Tr(Ae^A - 2AT.B + BQ-^B) + Tr{Ce^C^ + 2CEC + C^Q-^C), 

where E = (G^ + Q-^ - f]$ - m-^)/2 = 9^ - f]$ (the proof of the second "=" is 
nontrivial but will be omitted here). 

For illustration, we carry out the Hessian analysis for critical submanifolds that con- 
tain only type I and II singular values, where the corresponding © and E are diagonal. 
Moreover, we assume that cuj > 1 for alH = 1, • • • , A^, and the spectrum of Ed is so widely 
spaced that cui cuj for any cui < cuj (the other cases not involving type HI singular values 
can be dealt with as well but are relatively cumbersome) . Now suppose that the diagonal 
elements in © are ordered as 
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Then the Hessian form can be decomposed into TC{A, B, C) — Hi{A, B, C) +T-C2{A, B, C) 
with 



^ 2 

'2^ ' 



+ E 



l<i<j<N 



{ai+(Tj)cji 



+ 



n 2 



+ + (£^tf^ 



(15) 



+ E 



l<i<j<Af 



2\ ((Ti+(Tj)2 



+ 



62. 



(16) 



where a^, 6^ and are matrix elements of ^4, i? and C; and cr^ are diagonal matrix 
elements of © and E. The expressions consisting of square terms of independent variables 
actually represent the local coordinate system in which the HQF is diagonalized. This 
can be used to count the number of positive (negative or zero) Hessian eigenvalues by 
examining the signs of these square terms. 

The first part 'H\{X) contains N'^ + N positive definite terms with respect to any 
choice of X, and hence it provides N"^ + N positive Hessian eigenvalues. The terms 
in the (positive indefinite) second part 'H2{X) needs to be further analyzed. It is easy to 
see that the coefficients of the first N terms (1 — (^j)dj^b'jj, j = 1, ■ ■ ■ , N, are positive for 
type I singular values dj where aj — 0, and negative for type H singular values dj where 
aj = + dj'^ > 2. They provide N' = EI=iK positive and N" = J2t=i'^i Hessian 
eigenvalues. 

The signs for the remaining terms arc determined by the discriminants A'- = {df + 
df){df + d]) - {a, + a^f and A'l^ = {dj + d]){df + df) - {a, + a,)\ where i < j, 
whose signs correspond to that of Hessian eigenvalues in the coordinates of and bij, 
respectively. 

(1) When both di and dj are of type I, cTj = cTj = and hence both A'-j and A'lj are 
positive. This produces N'{N' — 1) positive Hessian eigenvalues. 
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(2) When both di and dj are of type II, the value of A-^ is: 

= {d^ + df){df + d]) - {d1 + df + d] + dff 

= -[{df + cij)^ + {d1 + + dj) + {d1 + ci-2)2] < 0, 

from which we can see that the corresponding Hessian eigenvalues are all negative. The 
same holds for Af^-. In total, this produces N"[N" — 1) negative Hessian eigenvalues. 

(3) When di is of type I and dj is of type II, the discriminants become: 

= {di + df){df + d]) - [d] + dff = dfidi - d]){d] - df), 
- {dl + d]){df + df) -{d] + df)^ = d%dl - df){df - df). 

Because the di and dj are always chosen to be greater than 1, it is easy to see that 
d^j — df > and df — df > except when di — dj — 1. Hence the signs of the 
discriminants are determined by 

A' j2 72 ,2 , 2/3 A // 7-6 7-2 , -2 , -2 

\j ~ - dj ^u^- Uj' , Aj^. ~ dj -d^ = u)j - . 

A^^- is positive only when uji ■< cuj, and otherwise negative. However, by assumption 
this holds only when a;, = cUj, which brings J2a^'a''^'a positive Hessian eigenvalues and 
X]a</3^a^/3 ncgativc Hcssian eigenvalues. The discriminant A^^- < for Ui < LVj, which 
brings ^cx</3 negative, and fT^'a^'L Hessian eigenvalues. 

(4) When di is of type II and dj is of type I, we may derive: 

Mij = {di + df){df + d]) - {di + dff = df{d] - di){dt - df), 
= {di + d]){df + df) - {di + dff = dt{d] - df){df - df), 

which lead to the similar criteria 

A^~4-d^ = u;|-u;f >0, i^lj^d^'-df^uf-uf>Q. 

The Hessian eigenvalues in this case are all positive and its number is 2 'Ylia<i3 "^a^is- 
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In conclusion, the total number of positive, negative and null Hessian eigenvalues can 
be summated as follows: 

r 

Do = Y.«^ (1^) 

Q = l 

r 

V+ = + N + N'^ + Y, « + 2 J] Km'p, (18) 

a=l a<l3 

= N"^ + 2j2^Wp- (19) 

a<l3 

From these formulas, it is easy to see that, among these critical submanifolds, there is 
only one local minimum 5** = in the landscape whose singular values are all of type 
I. Thehe rest of them are all saddle submanifolds because both D+ and D„ are nonzero. 
The same conclusion can be drawn for other critical submanifolds that have no type III 
blocks. 

The Hessian analysis for critical submanifolds involving type HI singular values is more 
complicated and any analytic formula is not available so far. However, it is not difficult 
to prove that they are all saddle submanifolds. So we may conclude the main theorem in 
this paper: 

Theorem 1 The optimization problem (J\) has a unique minimum S* = W and the rest 
of the critical submanifolds are all saddles. 

Proof: It is sufficient to prove that critical submanifolds involving type HI singular values 
have saddle structures, i.e., the corresponding Hessian form is neither positive or negative 
definite, or equivalently, there exist some X' 7^ and X" 7^ such that H{X') > and 
7i(X") < 0. Let Ai be such a critical submanifold, and S = diag{Tio,^i}, where 
Eq is a 2ki x 2ki type HI block and S2 contains the rest blocks. Choose a particular 
A = diag{Ao; 0} (similarly for B and C), where Aq corresponds to Sq and to Si, such 
that the resulting Hessian quadratic form are irrelevant to Si, i.e., 

n{Ao, Bo, Co) = Tr(Aoe^Ao-2AoSoSo+5oeo '5o)+Tr(CoegCo^+2CoSoCo+Co^0o 'Co). 
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Here the sub-block 

/ {dj - cos Xi) Ik, --fisinxih, 
So = I 

y -7^1 sin xJk, {d?i + li^ cos Xi)Ik, 

where 7, = {uJaUJp) ^ with Ua and cup being the pair of singular values of W that generates 
the singular value di = {cu^/uJaY^'^- The matrix 60 = dil2ki- 

Now choose Aq = I2, Bq = A/2 and Cq = 0, where A is to be determined. Then 

HiAo, Bo, Co) = Triel + 2ASo + A^Gq = 2A;,[(rf^ + A^') - A(2rf^ - (7, - 7-I) cosx^)]. 

According to the definition cosxj = (df — d^'^)/{'ji — 7^"^), the Hessian can be simplified 
as H^Aq, B'q, Cq) = 2/cj(l — X)df. So the corresponding H{Xo) is positive (resp., negative) 
when A < 1 (resp., A > 1 ), which implies that the Hessian is neither positive nor negative 
definite. End of proof. 

V. CRITICAL LANDSCAPE TOPOLOGY CONSTRAINED ON THE COM- 
PACT SYMPLECTIC GROUP 

Carrying out optimal control field searches over only the compact subgroup 
0Sp(2A^, M) is also important in many circumstances, e.g., using only linear quantum 
optics to search for a symplectic quantum gate ^. The derivation of the topology is sim- 
ilar to that for the landscapes on U{N) described above, since 0Sp(2A^, M) is isomorphic 
to U{N) [18]. The Lie algebra of 0Sp(2A^, M) consists of matrices of the form 

osp(2A^, M) = {A = JY = r, JY = YJ}. 

The condition for S* to be a critical point in the constrained landscape is 

Tt{W'^SJY) = 0, V jy G osp(2Ar, R), 

which can only hold if the matrix W^SJ is an element of the space complementary to 
that of B, which is equivalent to requiring that W'^S is an element of the Jordan algebra 
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of 0Sp(2A'", R)). W'^SJ must then simultaneously satisfy the two conditions 



WSJ = -{W'^SJf = JS'W, JiW'SJ) = {W'''SJ)J, 



\ 



0<m< N. 



which implies that W^S — S^W. This equation can be rearranged to give S — W^/I2N, 
where y/hn must lie within the group 0Sp(2A'", R). Because the cost functional is invari- 
ant with respect to the conjugation action of 0Sp(2A^, M), the solutions correspond to a 
set of 0Sp(2iV,M) orbits, i.e., S* = WR^D^R, where R e 0Sp(2Ar,M) and 

iN-m 

iN-m J 

There arc then + 1 solutions, with values of the cost functional JT" = 0,8, 16, ...,8N. 
The minimum and maximum values of correspond to 5" = and 5" = —W, re- 
spectively. The critical manifolds can be expressed as Grassmannian cosets M. — 
0Sp(2iV,M)/Stab(L'^), where Stab(L'^) = 0Sp(2m,M) x 0Sp(2(iV - m),M), so that 

, . 0Sp(2iV, R) 

^ ~ 0Sp(2m, R) X OSp(2(7V - m) , R) ' 

The HQF can be calculated by parameterizing the argument of ^{3) — Tr(1^^5') via 
Se"^^ as in the above. Taylor expanding the landscape function and keeping only the 
second-order term, we get the HQF, 

H{Y) = TT[W^S*{JYf] = TT[R^D^R{JYf] = Ti[{J RY R^f D^{JRY R^)]. 



Let X = RYR^ , which still satisfies the conditions X'^ — X and XJ— JX. X can be 
expressed in the form 

^ A C 
-C A 



X 
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where = A and = —C are A^-dimensional matrices. Let aij = aji and Cjj = —Cji 
are the matrix elements of A and C. Since any S* is represented by a corresponding 
matrix D, we obtain the following polynomial expression for the HQF: 

H{X) = -25^ a^y, - 2 ^ (a^, + + Si), 

j=l,N l<i<j<N 

where 6j is the j-th diagonal element of D^- It can then be verified that the landscape 
on the homogenous compact symplectic group has identical critical topology to the (uni- 
tary) transformation landscape on U{N) [19], with the following breakdown of Hessian 
eigenvalues for m = 0, ■ ■ ■ , A^: 

V+ = {N -mf, V_=m^, Vo = 2m{N-m). (20) 

As in the case of the optimization over the full symplectic group, the critical topology 
for compact target symplectic gates also consists of orbits of orthogonal symplectic groups, 
whose numbers of Hessian eigenvalues are 

V+ = + N + {N -mf, D_ = m^ Vo = 2m{N -m), (21) 

where m is defined in the standardized block Pq in Section HI. This shows that the critical 
topologies are very close between the full symplectic group and its compact subgroup, 
except for the A^^ + A^ difference in the number of positive Hessian eigenvalues. By this 
difference, the non-optimal critical points for Sp(2A^, M) are all saddle points, while one 
of 0Sp(2A^, R) is a minimal point. 

VI. EXAMPLES 

Consider the SUM gate that has one two-fold degenerate singular value lu = (-\/5 + 
l)/2. The analysis predicts that there are 4 critical submanifolds for this gate. The 
first one is the global minimum point SI = SUM, whose characteristic matrix Pi = 
diag{uj,uj^^,uj^^,uj} contains one type I block (m' = 2, m" = 0). 
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The second critical submanifold is an isolated saddle point, whose characteristic matrix 
P2 = diag{—uj~^/^ , -a;^/^, —u^/^, -o;"^/^} contains one type II block (m' = 0, m" = 2), 
and the corresponding critical point is 

/ -0.906 0.614 \ 

-0.292 -0.906 
-0.906 0.292 

-0.614 -0.906 



UP2V 



\ 



I 



As to the third one, where P3 = diag{uj^—ijj^l'^ ^—uj'^^^^ contains one type I 
and one type II blocks [rn! — 1, m" = 1), the corresponding critical submanifold is 
one-dimensional as the orbit of the 0(2) symmetry group of E. Parameterize Stab(£'d) 
as: 

/ cos^ sin^ ^ 
Tsin^ ±cos^ 
cos 6' =Fsin6' 

\ sin^ icos^ 

where the signs ± correspond to the two disjoint parts of 0(2). The critical submanifold 
can be expressed as an orbit of 0(2) group: 



ee[0,27r). 



0.152 cos + 0.047 0.990 cos + 0.307 0.307 sin 0.953 sin 

1.141 cos 61 + 0.354 0.152 cos 6* + 0.047 -0.953 sin 6* 0.646 sin 6* 

-0.646 sine -0.953 sin 0.047 - 0.152 cos ^ -0.354 + 1.141 cos 

0.953 sine -0.307 sin -0.307 + 0.990 cos 0.047 - 0.152 cos 



where the orbits of the two disjoint parts coincide with each other. 

The last critical submanifold contains a type III block {wl — 0, w!' — and wl" — 1). 
This block and its corresponding critical matrix are given by: 



22 



V 



^4* 



1 o^ 

10 
1 
10 

The Hessian analysis can be done for the first three critical points (submanifolds) with 
formula given in Section III. Here we exemplify the Hessian analysis with the critical point 
5*4, for which we don't have an explicit counting formula yet. Using expression (IT^ and 
the decomposition of X into {A, B, C), we have Hi^X) = Ti.i{A, B) + Ti.2{C), where 



/i ^ 

0-100 
10 

\o -1 y 



Hi (A 5) 



n2{c) 



al-^ + 2a^2 + + &ii + 2bl^ + h: 
+2uj^^(ai2bii + a22&i2) - 2a22&: 



2 

22 



2aii&ii - 4ai26i2 + 2uj{aiibi2 + 012622) 
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4c^i + 2c^2 + 2c2i + 4c22 - 2u;(ciiC2i + C22C21) - 2a; ^(ci2C22 + C11C12) + 4c2iCi2 



Denoting by x = (an, ai2, 022, &11, &12, ^22, cn, C12, C21, C22) the vector of independent 
variables, the Hessian form can be expressed as a quadratic polynomial x^Qx, where 



Q 





1 








-1 


UJ 






















2 







-2 


UJ 






















1 







-1 
















-1 







1 






















UJ 


-2 







2 






















UJ 


-1 








1 


































4 


-uj-^ 


— UJ 



























2 


2 


—UJ~ 






















—UJ 


2 


2 


—UJ 


V 























-uj-^ 


— UJ 
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is a block-diagonal 10-dimensional symmetric matrix. Numerical calculation shows that 
upper block corresponding to T-Ci{A,B) offers 4 positive and 2 negative Hessian eigen- 
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values; the lower block corresponding to 7^2 (C*) offers 2 positive and 1 negative Hessian 
eigenvalues. Hence the HQF has 7 positive and 3 negative eigenvalues. 

In summary, there are a total of six critical submanifolds including 3 isolated points 
and two one-dimensional manifolds. The Hessian analyses are summarized in Table I. 



No. 


Critical value 


^0 


^+ 


P_ 


type 


1 








10 





minimum 


2 


18.623 





6 


4 


saddle 


3 


9.311 


1 


8 


1 


saddle 


4 


10 





7 


3 


saddle 



TABLE I: Landscape characteristics for the SUM gate. 



VII. CONCLUSION 

We have resolved the critical solutions for least square problems on the symplectic 
group. The critical topology of this nonlinear optimization problem over a noncompact 
Lie group was shown to be of high complexity compared to that of analogous problems 
on compact Lie groups. However, the topology is still devoid of multiple local extrema, 
and the critical solutions consist of a finite number of critical submanifolds which are 
within a bounded region. These results have important applications to the study of 
control landscapes 2(J\ for classical mechanical systems or continuous variable quantum 
computation systems, implying that the search of optimal controls would encounter no 
essential obstructions. 

Due to the noncompactness of the symplectic group, the optimal implementation of 
symplectic transformations (or symplectic gates in continuous variable quantum compu- 
tation) might be more inefficient than that of unitary transformations (e.g., those applied 
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in discrete variable quantum computation). Nonetheless, recent OCT simulations using 
this objective function js] verify the prediction that local gradient-based algorithms will 
converge due to the lack of local traps in the landscape. 



APPENDIX A: STABILIZERS OF SYMPLECTIC MATRICES 

Here we give the structures of stabilizers of several kinds of symplectic matrices en- 
countered in this paper. The blocks Da = diag{aln] a^^/„} will be frequently encountered 
corresponding to a reciprocal pair of singular values a and a~^. We can substitute the 
standard form of R into the definition, which gives 



It is easy to see that, when a = 1, any matrix R G 0Sp(2n,M) is in the stabilizer. For 
a ^ 1, Y has to be zero, and hence leaves R = diag{X; X} where X G 0{n). So, the 
stabilizer for such D is isomorphic to 0{n). 
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